NoSQL Databases for RDF: An Empirical Evaluation
نویسندگان
چکیده
Processing large volumes of RDF data requires sophisticated tools. In recent years, much effort was spent on optimizing native RDF stores and on repurposing relational query engines for large-scale RDF processing. Concurrently, a number of new data management systems— regrouped under the NoSQL (for “not only SQL”) umbrella—rapidly rose to prominence and represent today a popular alternative to classical databases. Though NoSQL systems are increasingly used to manage RDF data, it is still difficult to grasp their key advantages and drawbacks in this context. This work is, to the best of our knowledge, the first systematic attempt at characterizing and comparing NoSQL stores for RDF processing. In the following, we describe four different NoSQL stores and compare their key characteristics when running standard RDF benchmarks on a popular cloud infrastructure using both single-machine and distributed deployments.
منابع مشابه
A MapReduce Approach to NoSQL RDF Databases
In recent years, the increased need to house and process large volumes of data has prompted the need for distributed storage and querying systems. The growth of machine-readable RDF triples has prompted both industry and academia to develop new database systems, called “NoSQL,” with characteristics that differ from classical databases. Many of these systems compromise ACID properties for increa...
متن کاملWorkload-Aware RDF Partitioning and SPARQL Query Caching for Massive RDF Graphs stored in NoSQL Databases
Governments, corporations, startups, open data initiatives and other organizations are increasingly considering RDF and SPARQL in a broad range of information management scenarios. To reduce SPARQL querying times has been the main issue for virtually all the recent RDF triplestores, yet SPARQL caching techniques have not been broadly considered. In this paper we present Rendezvous, a middleware...
متن کاملTranslation of Relational and Non-relational Databases into RDF with xR2RML
With the growing amount of data being continuously produced, it is crucial to come up with solutions to expose data from ever more heterogeneous databases (e.g. NoSQL systems) as linked data. In this paper we present xR2RML, a language designed to describe the mapping of various types of databases to RDF. xR2RML flexibly adapts to heterogeneous query languages and data models while remaining fr...
متن کاملFast In-Memory Reasoner for Oracle NoSQL Database EE: Uncover hidden relationships that exist in your enterprise data
Graph databases and NoSQL databases, two very important topics in Big Data, have gained popularity in recent years due to their unique characteristics in their horizontally scale-out capability and flexible schema or schema-free design. The recent release of OWL-DBC , an adaptor between Oracle Spatial and Graph 2 and the TrOWL reasoner [2, 1], has built a tight integration between one of the le...
متن کاملTowards a Use Case Driven Evaluation of Database Systems for RDF Data Storage - A Case Study for Statistical Data
To store Linked Data one may choose from a growing number of available database systems: from traditional relational databases to RDF triple stores, not to mention the area of NoSQL technologies. Comparisons of database systems often use benchmarks to evaluate systems with the best overall runtime performance. However, the structure of data and queries used in traditional benchmarks differ from...
متن کامل